Skip to content

Conversation

danbev
Copy link
Member

@danbev danbev commented Aug 8, 2025

This commit updates llama_kv_cache_unified::find_slot to log information for all streams when debug is enabled.

The motivation for this change is that currently if a non-unified kv-cache is used, then only one stream will be logged because the code is currently using seq_to_stream[1].

This commit updates `llama_kv_cache_unified::find_slot` to log
information for all streams when debug is enabled.

The motivation for this change is that currently if a non-unified
kv-cache is used, then only one stream will be logged because the
code was currently uses `seq_to_stream[1]`.
@danbev danbev merged commit cd3069d into ggml-org:master Aug 11, 2025
47 checks passed
@danbev danbev deleted the find_slot_debug_streams branch August 13, 2025 08:51
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants